Computer and Modernization ›› 2012, Vol. 198 ›› Issue (2): 31-34.doi: 10.3969/j.issn.1006-2475.2012.02.009

• 算法设计与分析 • Previous Articles     Next Articles

A Text Clustering Algorithm for Short Message

WU Yong, XU Feng   

  1. Department of Information Engineering, Hunan Mechanical & Electrical Polytechnic, Changsha 410151, China
  • Received:2011-10-27 Revised:1900-01-01 Online:2012-02-24 Published:2012-02-24

Abstract: As to short message text clustering, this paper designs a hybrid clustering algorithm combining by frequent termsets and Ant-Tree algorithm. This algorithm takes the advantage of efficiency of processing text data based on the frequent termsets clustering, produces the initial cluster, then eliminates the overlap text documents by calculating silhouette coefficient. Further refines the cluster by Ant-Tree. Thus gets the high quality clustering results. And the results that retain the description and tree structure can provide wider applications.

Key words: frequent term-sets, Ant-Tree algorithm, silhouette coefficient, short message, clustering

CLC Number: